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SECTION  1 
INTRODUCTION 

The  classical  theory  of  optimal  sensor  signal  processing  is  based  on 
statistical  estimation  and  hypothesis  testing  methods  [1].  The  salient  fea¬ 
tures  of  classical  signal  processing  theory  is  that  all  sensor  signals  are 
implicitly  assumed  to  be  available  in  one  place  for  processing.  In  recent 
years,  however,  there  has  been  an  increasing  Interest  in  distributed  sensor 
systems.  This  interest  has  been  sparked  by  large-scale  systems  such  as  power 
systems,  surveillance  systems,  etc.,  where  because  of  considerations  such  as 
cost,  reliability,  survivability,  communication  bandwidth,  compartmentaliza- 
tion,  or  even  problems  caused  by  flooding  a  central  processor  with  more  infor¬ 
mation  than  it  can  process,  there  is  never  centralization  of  information  in 
practice.  Thus,  extensions  are  needed  to  the  classical  framework  of  detec¬ 
tion  and  estimation  theory  if  it  is  to  be  relevant  to  the  design  of  distrib¬ 
uted  sensor  systems.  Such  extensions  need  to  take  into  account  (besides  the 
decentralization  of  information)  Issues  like  the  timeliness  as  well  as  the 
accuracy  of  the  decisions  of  the  detectors.  To  illustrate  these  Issues,  con¬ 
sider  the  following  example  from  surveillance  systems. 

Consider  two  airplanes  flying  over  a  surveillance  area  and  having  to 
decide  by  some  finite  time  T  whether  or  not  there  are  targets  in  that  area. 

The  airplanes  do  not  communicate  so  that  they  an;  not  easily  detected.  The 
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longer  the  airplanes  take  measurements  the  better  is  the  quality  of  informa¬ 
tion  they  collect  and  the  more  reliable  are  their  decisions.  On  the  other 
hand,  the  longer  the  airplanes  remain  over  the  surveillance  area  the  bigger 
are  the  chances  that  they  will  be  detected  and  the  longer  is  the  time  they 
cannot  be  used  for  some  other  purpose.  Thus,  accumulation  of  information  is 
costly  and  there  is  a  tradeoff  between  the  quality  of  information  upon  which 
the  decisions  of  the  airplanes  are  based  and  the  timeliness  of  their  decisions. 

Such  an  example  captures  a  lot  of  the  basic  features  of  distributed 
detection  problems.  The  airplanes  are  the  two  sensors  each  one  with  its  own 
information.  The  presence  or  nonpresence  of  targets  in  the  area  can  be  repre¬ 
sented  by  two  hypotheses  h  (h  -  0,1).  The  final  assessment  of  the  two  air¬ 
planes,  as  to  whether  targets  are  present  or  not,  can  be  represented  by  their 
decisions  ui(u£  -  0,1  i  -  1,2).  Since  information  is  costly  we  assume  that 
each  measurement  (after  the  initial  one)  taken  by  each  airplane  costs  c.  The 
fact  that  the  airplanes  have  the  same  goal  (i.e.  detect  targets)  can  be  repre¬ 
sented  by  a  common  terminal  cost  JCu^,  u2>  h)  which  couples  their  decisions. 

The  overall  objective  is  to  minimize  the  cost  due  to  the  measurements  and  the 
penalty  due  to  errors  in  detection.  Thus,  there  is  a  tradeoff  between  the 
quality  of  information  upon  which  the  decisions  u^  are  based  and  the  timeli¬ 
ness  of  these  decisions.  The  requirement  that  the  airplanes  do  not  communi¬ 
cate  can  be  incorporated  as  an  information  constraint  on  the  decision  problem. 

The  example  described  above  is  one  of  the  simplest  distributed  detection 
problems;  it  can  be  formulated  as  a  decentralized  Wald  problem  and  serves  as 
the  starting  point  of  a  detection  theory  for  distributed  sensors  described  in 
this  report. 
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SECTION  2 

OVERVIEW  OF  PREVIOUS  RESEARCH 

The  problem  of  constructing  decentralized  estimation  and  hypothesis  test¬ 
ing  rules  can  be  viewed  in  the  framework  of  decentralized  stochastic  optimal 
control.  Decentralized  stochastic  control  problems  have  been  studied  over 
the  past  15  years  (see  [2]  and  references  therein).  The  investigation  of 
these  problems  has  shown  that  they  are  in  general  very  complicated.  Even  the 
classes  of  decentralized  stochastic  control  problems  which  are  the  easiest  to 
analyze  (such  as  LQG  static  team  problems  [3],  or  team  problems  with  partially 
nested  information  structure  [4])  have  solutions  which  are  considerably  more 
complicated  than  the  solution  of  the  corresponding  centralized  stochastic 
control  problems.  This  characteristic  carries  over  to  decentralized  estima¬ 
tion  and  detection  problems. 

There  are,  however,  static  or  quasistatic  decentralized  estimation  and  detec¬ 
tion  problems  for  which  the  solution  has  been  analytically  derived.  Tenney 
and  Sandell  [5]  considered  the  first  simple  distributed  detection  problem 
where  there  are  two  hypotheses,  denoted  0  or  1 ,  and  two  detectors.  In  their 
formulation,  Tenney  and  Sandell  assumed  that  the  detectors  have  a  common  ob¬ 
jective  (i.e.,  their  detection  problems  are  coupled  through  the  cost)  and  each 
detector  takes  one  measurement  (or  a  set  of  measurements)  and  makes  a  decision 
based  on  his  own  information.  The  measurements  of  the  detectors  are  assumed 


to  be  Independent  conditioned  on  the  hypothesis.  Under  these  assumptions  it 
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was  shown  ([5])  that  Che  team  optimal  strategies  of  the  two  detectors  are 
described  by  thresholds  which  are  determined  by  the  solution  of  two  coupled 
nonlinear  algebraic  equations. 

Lauer  and  Sandell  [6] -[10]  extended  the  results  of  [5]  to  the  case  of 
correlated  waveform  observations.  They  found  that  in  general  the  determina¬ 
tion  of  the  optimal  decision  rules  of  the  two  detectors  requires  the  solution 
of  two  coupled  nonlinear  functional  equations.  Then,  Lauer  and  Sandell  exam¬ 
ined  several  special  cases  and  suboptimal  approaches.  For  the  special  case 
of  detecting  linearly  dependent  signals  in  white  noise,  they  determined  that 
the  local  likelihood  ratio  is  a  sufficient  statistic  for  detection  and  they 
computed  numerical  examples  for  the  case  where  the  signal  is  a  random  process. 
They  examined  a  suboptimal  solution  consisting  of  local  likelihood  ratio  tests 
with  jointly  optimized  thresholds,  and  obtained  results  for  a  number  of  inter¬ 
esting  cases. 

Ekchlan  [11]  considered  a  problem  similar  to  that  of  Tenney  and  Sandell 
[5]  but  assumed  in  addition  that  a  unidirectional  communication  link  exists 
between  the  two  detectors.  Ekchlan  found  that  the  team  optimal  decision  rules 
of  the  two  detectors  are  described  by  thresholds ,  the  computation  of  which  is 
coupled.  In  addition,  he  found  that  the  detector  receiving  the  communication 
uses  one  of  two  thresholds  depending  on  the  decision  of  the  other  detector. 

The  work  on  distributed  detection  reported  in  [5]— [11]  assumes  a  model 
with  static  hypotheses  (i.e.,  the  true  hypothesis  does  not  change  with  time) 
and  static  observations  (i.e.,  the  detectors  take  one  measurement  or  a  set  of 
measurements  and  make  a  decision). 

Teneketzis  [12]  considered  a  distributed  detection  problem  with  static 
hypotheses  and  dynamic  observations  (i.e.,  at  each  instant  of  time,  each 
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detector  can  either  stop  and  make  a  decision  or  request  more  Information  at 
some  cost).  Teneketzis  [12]  formulated  a  finite  horizon  decentralized  optimal 
stopping  problem  with  two  hypotheses  and  two  detectors,  which  is  the  decen¬ 
tralized  version  of  Wald's  problem.  He  found  that  the  optimal  decision  rules 
of  the  two  detectors  are  described  by  thresholds.  The  thresholds  of  the  two 
detectors  are  time  varying  and  coupled,  and  are  determined  by  the  solution  of 
4N-2  nonlinear  algebraic  equations  in  4N-2  unknowns,  where  N  is  the  horizon 
of  the  problem. 

Subsequently,  Teneketzis  and  Varaiya  [13]  solved  a  distributed  detection 
problem  with  dynamic  events  (i.e.,  the  case  in  which  the  true  hypothesis 
changes  with  time)  and  dynamic  observations.  They  formulated  an  infinite 
horizon  decentralized  optimal  stopping  problem,  with  two  hypotheses  and 
detectors,  which  is  the  decentralized  version  of  a  quickest  detection  pre  >1 
They  found  that  the  optimal  decision  rules  of  the  two  detectors  are  described 
by  time-varying  thresholds  which  can  be  determined  by  the  solution  of  two 
coupled  dynamic  programming  equations. 

Kushner  and  Pacut  [14]  studied  a  decentralized  detection  and  coordina¬ 
tion  problem  via  simulation.  They  considered  two  hypotheses,  0  or  1,  and 
two  detectors.  Each  detector  takes  an  observation  at  time  1  and  may,  if  it 
wishes,  take  an  observation  at  time  2.  The  second  observation  costs  C.  The 
detectors  do  not  communicate  with  each  other.  At  the  end  of  its  "observation 
period"  each  detector  transmits  its  conditional  probabilities  of  the  hypoth¬ 
eses  to  a  coordinator  who  then  computes  the  posterior  probability  and  decides 
on  either  0  or  1.  Kushner  and  Pacut  [14]  investigated  the  effects  of  prior 
probabllitiy  and  parametric  dependencies  on  the  decision  rules,  as  well  as 
sensitivity  to  the  data,  asymmetries  in  the  design  rules  and  other  phenomena. 
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The  problems  studied  in  [5]-[14]  are  decentralized  detection  problems. 

Barta,  (15J,  first  formulated  a  decentralized  estimation  problem.  He 
considered  two  agents  (estimators)  who  have  access  to  different  information, 
do  not  communicate,  and  want  to  estimate  a  Gaussian  random  variable.  The 
estimation  rules  of  the  agents  were  coupled  through  the  cost  function.  Barta 
restricted  attention  to  linear  estimation  rules  and  derived  a  recursive  solu¬ 
tion  for  the  best  linear  estimates  of  the  agents. 

The  objective  of  the  problems  studied  in  [5]-[15]  was  the  determination 
of  the  optimal  decision  rules  of  the  agents  in  distributed  estimation  and 
detection  problems. 

Borkar-Varaiya  [16],  Tsitsiklis-Athans  [17],  Washburn-Teneketzis  [18] 
and  Teneketzis-Varaiya  [19]  considered  a  different  class  of  distributed  esti¬ 
mation  and  detection  problems.  Namely,  [16]— [ 19]  consider  a  set  of  distrib¬ 
uted  communicating  agents;  each  agent  makes  a  decision  (estimate,  detection, 
etc.)  according  to  a  fixed  prespecified  rule,  and  communicates  it  to  the  other 
agents  who  update  their  decisions  according  to  the  same  rule.  The  convergence 
and  agreement  of  the  decisions  (estimates,  detections,  etc.)  is  the  topic  of 
investigation  in  [16]  —  [19]  . 

Borkar  and  Varaiya  [16]  investigated  the  consensus  pioblem  when  the 
agents  generate  and  exchange  the  conditional  expectations  of  the  random  vari¬ 
able  they  want  to  estimate.  They  showed  that  the  agents'  estimates  eventually 
agree.  Tsitsiklis  and  Athans  [17]  extended  the  results  of  [16]  to  the  case 
where  the  objective  function  is  an  arbitrary  convex  function  of  the  decision 
variables  of  the  agents.  They  showed  that  when  the  decisions  generated  by 
the  optimal  decision  rule  are  communicated,  then  asymptotically  the  decisions 
of  the  distributed  agents  agree. 
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Washburn  and  Teneketzis  [18]  considered  a  setup  similar  to  that  of  [16] 
and  [17]  and  characterized  the  properties  which  any  decision  rule  must  satisfy 
in  order  to  result  in  agreement  among  the  communicating  agents. 

The  results  reported  in  [16]— (18]  assume  that  all  the  communicating 
agents  have  the  same  probabilistic  description  of  the  random  variables  they 
attempt  to  estimate  or  detect. 

Teneketzis  and  Varaiya  [19]  considered  the  consensus  problem  for  a  dis¬ 
tributed  estimation  problem  where  the  agents  have  a  different  probabilistic 
description  of  the  random  variables  they  want  to  estimate  and  where  they 
exchange  their  conditioned  expectations;  they  demonstrated  the  effect  of  dif¬ 
ferent  models  in  the  outcome  of  the  estimation  process. 
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SECTION  3 

SUMMARY  OF  RESEARCH  RESULTS 

During  Che  first  two  years  of  our  research  we  concentrated  on  distributed 
detection  problems  with  static  hypotheses  and  static  observations.  The  main 
features  of  these  problems  are:  There  are  two  or  more  detectors  and  two 
hypotheses;  the  true  hypothesis  does  not  change  with  time;  the  detectors  have 
access  to  different  information,  do  not  communicate,  they  take  one  observation 
and  make  a  decision;  the  decisions  of  the  detectors  are  coupled  through  their 
common  objective. 

The  results  of  our  research  on  distributed  detection  with  static  hypothesis 
and  static  observations  have  been  documented  in  [6]— [10] .  As  pointed  out  in 
Section  2,  Lauer  and  Sandell  ([6]— [10])  treat  distributed  detection  problems 
with  correlated  waveform  observations,  and  examine  optimal  and  suboptimal 
approaches  for  the  solution  of  these  problems. 

During  the  third  year  of  our  research  we  concentrated  on  distributed 
sequential  detection  problems.  The  main  features  of  these  problems  are: 

There  are  two  or  more  detectors  who  have  access  to  different  information  and 
have  to  detect  one  of  two  possible  events  as  quickly  and  as  accurately  as 
possible.  The  detectors  may  or  may  not  communicate.  If  they  communicate 
they  can  only  exchange  limited  information. 

We  have  formulated  and  studied  three  distributed  sequential  detection 


problems:  (1)  The  Infinite  Horizon  Decentralized  Wald  Problem,  (2)  An 
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Extension  of  the  Decentralized  Quickest  Detection  Problem  and  (3)  A  distrib¬ 
uted  sequential  detection  problem  with  limited  communication  and  ordered 
stopping  times. 

For  the  infinite  horizon  decentralized  Wald  problem  ([20])  we  derived 
the  qualitative  properties  of  the  optimal  decision  rules  of  the  detectors. 

We  showed  that  the  team  optimal  decision  rules  of  the  detectors  are  charac¬ 
terized  by  time  invariant  thresholds  whose  computation  requires  the  solution 
of  two  coupled  sets  of  dynamic  programming  equations.  Since  the  numerical 
solution  of  those  equations  is  difficult,  we  developed  a  suboptimal  algorithm 
which  captures  the  basic  features  of  the  optimal  solution,  and  it  is  easy  to 
implement.  The  study  of  the  qualitative  properties  of  the  optimal  solution 
of  the  infinite  horizon  decentralized  Wald  problem  appears  in  Appendix  B. 

The  description  of  the  suboptimal  algorithm  as  well  as  the  numerical  results 
derived  from  its  implementation  appear  in  Appendix  C. 

The  decentralized  Quickest  Detection  Problem  ([21])  studied  during  this 
year  is  an  extension  of  [13].  We  consider  two  detectors  who  take  independent 
noisy  observations  of  a  two-state  Markov  chain  and  have  to  decide  when  the 
chain  jumps  from  state  0  to  state  1.  The  decisions  of  the  detectors  are 
coupled  through  a  common  cost  function  where  delays  in  detection  of  the  jump 
as  well  as  false  alarms  are  linearly  penalized.  It  is  shown  that  the  opti¬ 
mal  decision  rules  of  the  detectors  are  characterized  by  thresholds.  These 
thresholds  are  time-varying  and  their  computation  requires  the  solution  of  two 
coupled  sets  of  dynamic  programming  equations.  Numerical  solution  of  these 
equations  is  very  difficult;  thus,  the  paper  provides  only  a  qualitative  char¬ 
acterization  of  the  optimal  decision  rules  of  the  detectors.  However,  a  com¬ 
parison  with  the  thresholds  of  a  class  of  centralized  finite  horizon  quickest 
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detection  problems  is  possible.  Such  a  comparison  reveals  the  nature  of  the 
coupling.  The  results  of  our  study  on  the  Quickest  Detection  Problem  are 
documented  in  Appendix  A  ([21]). 

Distributed  sequential  detection  problems  without  any  communication  among 
the  detectors  are  in  general  team  problems  with  static  information  structure. 
When  the  detectors  are  allowed  to  communicate  and  exchange  limited  informa¬ 
tion,  then  this  limited  communication  results  in  dynamic  team  problems  which 
are  not  sequentially  decomposable.  It  is  very  difficult  to  analyze  these 
problems,  that  is  why  good  suboptimal  solutions  are  necessary. 

We  considered  a  simple  distributed  sequential  detection  problem  with 
limited  communiction  and  ordered  stopping  times;  and  we  developed  a  subopti¬ 
mal  algorithm  which  is  simple  to  implement.  The  proposed  algorithm  is  very 
similar  to  the  one  proposed  for  the  Infinite  horizon  decentralized  Wald 
problem;  the  thresholds  used  are  the  same  as  in  the  suboptimal  algorithm  for 
the  Wald  problem;  a  simple  suboptimal  Interpretation  of  the  communicating 
message  is  suggested;  such  an  interpretation  avoids  the  difficulties  intro¬ 
duced  by  signaling.  Further  testing  of  the  proposed  suboptimal  algorithm  is 
needed.  The  formulation  of  the  distributed  sequential  detection  problem  with 
communication  as  well  as  the  suboptimal  algorithm  proposed  for  its  solution 
are  documented  in  Appendix  C  ([22]). 
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ABSTRACT 


Two  detectors  take  independent  noisy  observations  of  a  two-state  ({0,1}) 
Markov  chain  and  have  to  decide  when  the  Markov  chain  jumps  from  state  0  to 
state  1.  The  decisions  are  coupled  through  a  common  cost  function  where  de¬ 
lays  in  detection  of  the  jump  as  well  as  false  alarms  are  linearly  penalized. 
It  is  shown  that  the  optimal  decision  rules  of  the  detectors  are  character¬ 
ized  by  thresholds.  These  thresholds  are  time-varying  and  their  computation 
requires  the  solution  of  two  coupled  sets  of  dynamic  programming  equations. 
Numerical  solution  of  these  equations  is  not  possible;  thus  the  paper  pro¬ 
vides  only  a  qualitative  characterization  of  the  optimal  decision  rules  of 
Che  detectors.  However,  a  comparison  with  the  thresholds  of  a  class  of  cen¬ 
tralized  finite  horizon  quickest  detection  problems  is  possible.  Such  a  com¬ 
parison  shows  the  structure  of  Che  coupling. 

Key  Words:  Decentralized  Quickest  Detection,  Markov  chain.  Stopping 
Rules,  Dynamic  Programming,  Threshold. 


1.  INTRODUCTION 


Two  detectors  make  Independent  observations  of  a  Marko”  chain  xt  which 
jumps  from  state  0  to  state  1  at  some  random  time  8.  Each  detector  has  to 
detect  the  time  of  the  jump  based  on  its  own  noisy  measurements;  let  xj  be 
the  time  detector  i  declares  that  the  jump  has  occurred.  The  problem  is  to 
find  stopping  times  Tj  which  minimize  the  expected  cost  EJ(Xj,x2,0). 

If  the  cost  is  separable,  i.e.,  J(xlfx2,0)  »  J( x  , 0)+J(x2, 0) ,  then  the 
decisions  of  the  two  detectors  are  decoupled.  In  such  a  case  for  certain 
costs  Xj,  (see  [1],[2])  the  optimal  decision  X*  is  to  stop  and  declare  the 
jump  the  first  time  instant  the  probability  of  “false  alarm”  drops  below  a 
time-invariant  threahold  l±,  i.e., 


x 


* 


i 


min(t|Prob(0>t|Yi)  <  ii} 


(1-1) 


where  Y*  is  :he  information  available  to  detector  i  at  time  t.  The  threshold 
t 

property  holds  for  the  cost  functions 


J( x , 0)  -  C( x, 0)  l(x>0)  +  l(x<0) 


(1-2) 


and 


J(x,0)  -  C( x-0)  l(x>0)  +  k( 0-x)  1 ( x< 8)  (1-3) 


If  the  cost  J(x1,x2,0)  is  not  separable,  then  there  is  an  interaction 
between  the  optimal  decisions.  Detection  problems  with  nonseparable  costs 
have  been  previously  Investigated  in  [3)-[8].  The  problem  investigated  in 


this  paper  has  a  nonseparable  cost  but  is  essentially  different  from  those  of 
1 5 ) —  1 8 1  which  are  not  sequential;  it  is  also  different  from  the  problem  of  [A] 
where  the  Markov  chain  is  frozen  in  one  of  two  states;  it  is  similar  to  the 
problem  considered  in  [ 3 [ ,  but  is  has  a  cost  function  different  from  that  of 
[3].  The  cost  function  considered  in  [3]  was 

j(vve)  ■  c(v0)  i(xi>e) +  c(t2"9)  ^v0)  +  ^v0)  ^v0) 

(1-4) 

This  cost  puts  a  constant  penalty  for  false  alarms  and,  for  each  detector,  a 
penalty  proportional  to  the  delay  in  detecting  the  jump. 

The  cost  function  considered  in  this  paper  is 

J(vv0)  ■  k(0-V0-T2)  1(T1<0)  Uv0)  +  c(v0)  ^V6)  +  c(v0)  Uv0) 

(1-5) 

This  c«st  puts,  for  each  detector,  a  penalty  proportional  to  the  delay  in 
detecting  the  jump  and  a  penalty  proportional  to  how  early  false  alarms 
occurred.  Thus,  the  coupling  of  the  detectors  (described  by  the  term 
k(0— tj+O-Tj)  1(Tj<0)  1(t2<0))  is  different  from  that  of  [3].  Because  of 
the  coupling  through  the  cost  function,  there  is  an  interaction  between  the 
optimal  decisions  of  the  detectors.  The  Interaction  is  simple  since  there 
is  no  communication  between  the  detectors. 

In  this  paper  we  show  that  the  member  by  member  optimal  (m.b.m.o.)  deci¬ 
sions  of  the  two  detectors  are  described  by  thresholds  as  in  [3].  These 
thresholds  are  time  varying,  as  in  [3],  and  their  computation  requires  the 
solution  of  two  coupled  sets  of  dynamic  programming  equations.  It  is  not 
possible  to  solve  numerically  these  equations,  thus  the  paper  provides  only 


a  qualitative  characterization  of  the  optimal  decision  rules  of  the  detectors. 
However,  it  is  possible  to  compare  the  m.b.m.o  thresholds  with  the  thresholds 
of  a  centralized  problem.  We  prove  that  for  each  instant  of  time  t  the  m.b.m.o. 
thresholds  lie  above  the  thresholds  at  t  of  class  of  finite  horizon  N  (N>t) 
centralized  quickest  detection  problems;  the  thresholds  of  these  centralized 
problems  are  time-varying.  A  similar  comparison  was  achieved  in  [3],  However, 
whereas  in  [3]  the  m.b.m.o.  thresholds  were  compared  to  a  stationary  threshold, 
such  a  comparison  is  impossible  for  the  present  problem. 

The  remainder  of  the  paper  is  organized  as  follows:  The  formal  model  is 
presented  in  Section  2.  The  characterization  of  the  m.b.m.o.  solutions  of  the 
decentralized  quickest  detection  problem  is  presented  in  Section  3.  In  Section 
4  the  comparison  of  the  m.b.m.o.  thresholds  with  the  thresholds  of  a  class  of 
finite  horizon  centralized  quickest  detection  problems  is  made.  Concluding 
remarks  appear  in  Section  5. 


2 .  THE  MODEL 

Consider  of  Markov  chain  {xt , t=l ,2 , . . . }  with  values  in  {0,1},  known 
transition  probabilities 


Prob(xt+i  =  l|xc=0)  =  p 


Prob(xt+i  =  l|xc-l)  =  1 


(2-1) 

(2-2) 


and 


Prob(x1=0)  =  1 


(2-3) 


Thus,  the  chain  makes  a  jump  to  state  1  at  the  random  time  8  _A  min {t :xc=,i } . 
Detector  i 's  observation  at  time  t  is 


y1  "  gi(x  ,wi)  i  -  1,2 
t  t  t 


(2-4) 


where  it  is  assumed  that  {w*},  i=*l,2,  are  mutually  independent  i.i.d.  sequences 

t 

which  are  also  independent  of  {x^}. 

The  problem  the  detectors  are  faced  with  is  the  following: 


Min  Min  E{k(8-T1  +  9-t2)  l(xTi=0)  l(xT2=0) 


1  2 


Ti-1 


t2-1 


+  C  l  l(x  =1)  +  C  l  l(x  =1 )} 
t-1  t»l 


(2-5) 


subject  to  Eqs.  2-1  through  2-4. 


20 


where  tj  is  a  Y*  stopping  time  and 
t 

Y1  A  o(yi,  s<t)  (2-6) 

t  “ '  s 

Problem  (P)  is  a  team  problem.  In  this  paper  we  shall  derive  certain  quali¬ 
tative  properties  of  the  solution  of  problem  (P).  We  shall  prove  that  the 
member  by  member  optimal  stopping  rules  of  the  detectors  are  characertized  by 
time  varying  thresholds  whose  computation  requires  the  solution  of  two  coupled 
sets  of  dynamic  programming  equations. 
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3.  CHARACTERIZATION  OF  THE  OPTIMAL  SOLUTION 


In  this  section  we  shall  characterize  the  member  by  member  optimum  solu¬ 
tions  of  problem  (P).  The  main  result  of  the  paper  which  we  shall  prove  in 
this  section  can  be  summarized  by  the  following  theorem: 

Theorem  3.1 

The  member-by-member  optimum  decision  rules  of  the  detectors  are  char¬ 
acterized  by  thresholds  as  in  the  case  of  a  single  detector.  However ,  the 
thresholds  are  time-varying  and  their  computation  requires  the  solution  of 
two  coupled  sets  of  dynamic  programming  equations. 

The  proof  of  this  theorem  proceeds  in  various  steps. 

Fix  t2— T2  (P°sslbly  at  tbe  optimum).  Then,  the  problem  faced  by  detector 


Min  EJ(Tj)  -  Min  E|k(8-T1  +  &-t*)  l(x^=0)  1(xt  »0) 
T1  T1  *  T2  Tl 

Tl_1  I 

+  C  l  l(x  -1)  } 

C-l  ) 


(3-1) 


Define 


u1  A  Prob(x  =0|Yl)  -  Prob(e>t|Y1) 


(3-2) 


Then,  we  can  alternatively  write  the  cost  in  Eq.  3-1  as  follows: 


22 


where 


h1  -  k  l  a(i-p)*-1  p  e { i ( T*<e) | e=t+A } 
c  4-1 

t  " 

H  [  Prob( T*<r | 0>r)  +  l  Prob(T*<t+4| 0>t+i)(l-p)* 
r-0  2  4-1  2 


-  k  l  (1-p)*"1  Prob(0>T*| 0>t+4) 
4-1  2 


(3-4) 


and 

(z]t  -  max(0 , z)  (3-5) 

The  equality  of  the  cost  functions  in  Eqs.  3-1  and  3-3  is  shown  In  Appendix  A. 

Thus,  detector  1  has  to  determine  a  Y1  stopping  time  to  minimize  Eq.  3-3. 

t 

Since  the  detectors  do  not  communicate  and  T  is  fixed,  a  Y1  stopping  time 

2  t 

for  detector  1  can  be  determined  by  Dynamic  Programming.  If  H1  were  constant 

t 

then  the  problem  faced  by  detector  1  for  fixed  would  be  a  quickest  detec¬ 

tion  problem  similar  to  that  studied  in  (1],[2).  Since  H1  is  time-varying  it 

t 

is  necessary  to  study  the  value  function  of  the  dynamic  program. 

3.1  DYNAMIC  PROGRAMMING 

To  study  the  value  functions  we  first  consider  (as  in  [3])  a  finite 


horizon  problem. 


Finite  Horizon 


Fix  the  final  time  T  (T<°>)  and  consider  the  problem 


Minimize  EJ(t  )  (3-6) 

1<T  j<T  1 


The  dynamic  programming  argument  for  this  problem  shows  that  the  value  func¬ 
tion  V1T(x),  i.e., 

|  T‘-‘ 

V1T(W1)  .  min  E  Hi  l(x  =0)  +  C  l  l(x  -1) 
t  <  T i <T  (  T1  T1  r-t  r 


(3-7) 


is  obtained  by 


VlTUl)  „  h1  „1  (3-8) 

T  T 

vlT(1fl)  m  min {h*  x1  ,  (L  +  Cfl-x1)}  t<T  (3-9) 

where 

(LV)(xl)  -  /  V(A(xl,y))  q(y|*l)  dy  (3-10) 

q(y|x1)  -  x^l-p)  Pj(y)  +  x1  p  pj(y)  +  (1-x1)  P*(y)  (3-11) 

A(x!,y)  »  x^l-p)  P1(y)/q(yjp)  (3-12) 

0 

and 

Pl(y)  is  the  probability  density  of  the  measurement  y1  under  the  assumption 
1  t 

that  X[Mi.  The  term  x*  in  Eq .  3-9  describes  the  cost  incurred  by  the 

t 

decision  to  stop  at  time  t  and  the  term  CO-x1)  +  (l  V*T  )  (xl)  describes  the 

V  t+1 

cost  incurred  by  the  decision  at  t  to  continue  taking  measurements.  Thus,  it 
is  optimal  for  detector  1  to  stop  if  and  only  if 


The  value  function  V^T  (it)  has  the  following  important  property: 

t 

Lemma  3.1 

V1T  (*)  is  a  nonnegative  concave  function  of  it(t”l  ,2, . . .  ,T) .  (L  V1t  l(n) 
t  t+1 

is  also  a  nonnegative  concave  function  of  it.  (t**l  ,2 , . . .  ,T) 

Proof :  The  proof  is  the  same  as  that  of  Lemma  3.1  in  [3].  □ 


Lemma  3.2 


At  x-0 


At  w-1 


H1  it  <  C(l-ir)  +  (L  Vi*  )  (it) 
t  t+1 


H1  it  >  C(l-if)  +  (L  V1*  )  (it) 
t  t+1 


Proof :  See  Appendix  B.  □ 


(3-14) 


(3-15) 


Lemmas  3.1  and  3.2  imply  the  threshold  property  of  agent  1  's  optimal 
policy  for  fixed  (arbitrary)  and  for  the  finite  horizon  problem. 

Infinite  Horizon 

To  minimize  Eq.  3-3  let  t-*,“.  Since  the  set  of  stopping  times  of 
agent  1  increases  as  T  increases,  it  follows  that 

yl(T+l )  („1)  <  ylT  (nl)  (3-16) 

t  t 

and  since  the  value  functions  Vl<r  (it1)  are  nonnegative,  the  following  limit 


is  well  defined: 


i  ' 

I  B 

I  ■ 

I 


E  I 


B  k 


1 

V1  (ir)  A  lim  V1T  (n)  -  inf  V1?  (it)  (3-17) 

t  ”  t->«»  t  t  c 

We  can  show  that  the  value  function  V*  (it)  has  the  following  properties  1 

t 

Lemma  3.3 

The  value  function  V*  (it)  satisfies  the  equation 

t  i 

V1  (w1)  -  min  |  Hl  a1,  C(l-*1)  +  (l  V1  )  (it1)  \  t-1,2,...  (3-18). 

t  it  t+1  ) 

V1  (w)  as  well  as  fh  V1  )  (n)  are  nonnegative  and  concave  functions  of  *. 
t  t+1  i 

Moreover,  at  it-0  and  x-1 ,  Eqs.  3-14  and  3-15  hold  respectively. 

Proof :  Eq.  3-18  is  obtained  by  Eq.  3-9.  The  nonnegativity  and  concavity  of 

V1  (s)  and  (L  V1  )  (ir)  follow  from  Lemma  3.1.  The  inequalities  at  ir-0  and 
t  t+1  i 

it*l  follow  from  Lemma  3.2.  □ 

Furthermore,  it  is  possible  to  show  that  the  value  functions  have  the 
following  additional  property. 

Lemma  3.4 

The  value  functions  {v1  (w)}  are  the  unique  solution  of 

Zt(it)  -  min  |  H1  it,  C(l-n)  +  (L  Zt+i )  (it)  J  ,  t-1,2,...  (3-19) 

Proof :  See  13),  Theorem  3.2.  □ 

The  properties  of  the  value  function  V1  (n)  imply  the  following  charac- 

t 

terlzation  of  agent  1 's  optimal  policy  for  fixed  t2> 

Lemma  3.5 

For  fixed  x?  the  optimal  stopping  time  for  detector  1  is 

x*  -  min (t:  ir1  <  i1}  (3-20) 

*  t  t 
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(3-21) 


The  threshold  41  is  defined  by  the  solution  of: 
t 

H1  n  -  C(l-n)  +  (L  V1  )  (*) 
t  t+1 

Proof:  The  concavity  of  (l  V1  )  (if)  and  the  inequalities  3-14  and  3-15  imply 

t+1 

imply  that  the  functions  H1  w  and  C(l-w)  +  (L  V1  )  (*)  intersect  at  one  point. 

t  t+1 

The  solution  of  Eq.  3-21  defines  this  point.  From  Eq.  3-18  the  optimal  deci¬ 
sion  optimal  decision  rule  of  agent  1  at  time  t  is  to  stop  if  and  only  if 

V1  (ir)  -  H1  ir  (3-22) 

t  t 

Equation  3-22  gives  the  rule  (3-20)  QED.  □ 

We  are  now  in  a  position  to  prove  the  main  result  of  this  paper  which  was 
summarized  by  Theorem  3.1. 

Proof  of  Theorem  3.1 

Since  the  optimum  policy  of  agent  1  is  characterized  by  thresholds  for  any 

arbitrary  fixed  policy  of  agent  2,  it  will  also  be  characterized  by  thresholds 

when  the  policy  of  agent  2  is  the  optimum.  By  symmetry,  the  optimal  policy  of 

agent  2  is  characterized  by  thresholds.  Hence,  the  member-by-member  optimal 

policies  of  the  agents  are  characterized  by  thresholds  {i1*  ,  Jt2*}.  These 

t  t 

thresholds  are  determined  by  the  solution  of  a  set  of  coupled  dynamic  program¬ 
ming  equations  which  are  of  the  form 


Hl({*2*})  t1*  -  Cfl-*1*) 
t  t  t  c 

+  (L  v»  H*1*)  t-1,2,... 

t+I  t 

(3-23) 

l2*  -  C(l-*2*) 
2  t  t  t 

+  (L  vl  )(i2*)  t-1,2,... 

t+1  t 

(3-24) 

The  proof  of  Theorem  3.1  is  now  complete.  0 


It  is  not  possible  to  solve  numerically  the  set  of  coupled  dynamic  programming 

equations  described  by  Eqs.  3-23  and  3-24;  thus,  the  results  of  this  paper 

provide  only  a  qualitative  characterization  of  the  optimal  decision  rules  of 

the  detectors.  It  is  possible,  however,  to  compare  the  thresholds  {i1*  ,  42*} 

t  t 

with  the  thresholds  of  centralized  quickest  detection  problems  as  shown  in 
the  next  section. 


I 

4.  COMPARISON  WITH  A  CLASS  OF  CENTRALIZED  QUICKEST  DETECTION  PROBLEMS 

In  this  section  we  shown  chat  for  each  time  t  it  is  possible  to  compare 
^  the  m.b.m.o.  thresholds  of  problem  (P)  with  a  class  of  finite  horizon  cen¬ 

tralized  quickest  detection  problems.  The  thresholds  of  these  centralized 
problems  are  time-varying.  Thus,  the  results  obtained  in  this  section  are 
different  from  those  of  [3]  where  comparison  of  the  m.b.m.o.  thresholds  with 
a  time-invariant  thresholds  was  possible. 

Let  (i**  ,  A2)  be  two  m.b.m.o.  thresholds  for  problem  (P)  at  time  t. 
t  t 

Consider  a  finite  horizon  centralized  quickest  detection  problem  [1],[2]  with 
final  time  N>t  and  cost 


K 

where 

I 


(  T— 1 

J(t,0)  >E  GN  1(T<0)  +c  l  l(xt*l) 

(  T  t=l 


gn 

t 


1 1 

i  l  1  l<1' 

(  b  1=1 


p  )  *_  1  p+N+2- 
P 


and 


N 

b  -  l  i(l-p)*-1  p 
1=1 


(4-1) 


(4-2) 


(4-3) 


It  is  well  known,  (1],(2],  chat  Che  optimal  stopping  rule  for  the  above 

r  mi  N 

problem  is  described  by  thresholds  |ANj  which  are  determined  by  the  solu¬ 
te  t  =  l 

cion  of  a  set  of  equations  in  the  form 
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I 


CN  xN  «  c(l-XN)  +  [L  mN  ]  (XN) 
c  t  c  c+l  t 


(4-4) 


where  MN  (w)  is  defined  by 
t 


Mn  (w)  -  k 
N 


M**  (*)  “  min 
C 


JgN  tt  ,  C(l-n)  +  [L  Mn  ]  (*)  [ 
\  t  C+l  ) 


(4-5) 


.,N-1  (4-6) 


and  [L  mN  ]  ( n )  is  defined  in  a  way  analogous  Co  Eqs.  3-10  chrough  3-12. 

C+l 

The  main  resulc  of  chis  seccion  is  summarized  by  Che  following  cheorem: 


Theorem  4.1 

Lee  (f**  ,  A2*)  be  cwo  m.b.m.o.  chresholds  for  problem  (P)  ac  cime  C, 
c  c 

and  lec  XN  be  che  opcimal  chreshold  ac  cime  r  for  Che  cencralized  quickesc 
C 

dececcion  problem  wich  final  Cime  N  and  cosC  given  by  Eq.  4-1.  Then,  for 
all  C<N 


XN  <  *i*  i-1,2  . 

c  c 


(4-7) 


The  proof  of  Che  cheorem  proceeds  in  various  seeps.  Firsc,  we  prove  Che 
following  lemmas: 


Lemma  4.1 


For  any  c,s  ,  e>s 


Gn  <  gn 
t  s 


(4-8) 


Proof:  Follows  direccly  from  Che  definicion  of  G^ . 

c 
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» 


a 


Lemma  4.2 


For  any  c,s  ,  t>s 


Hi  >  Hi  (1=0,2) 
t  s 


(4-9) 


where  Hi  is  defined  by  Eq .  3-4. 
t 


Proof:  See  Appendix  C. 


Lemma  4.3 


For  all  t<N, 


Hi  <  GN  ,  (i-1,2)  . 

t  t 


(4-10) 


Proof:  Follows  directly  from  the  definitions  of  Hi  and  G^. 

t  t 


We  are  now  ready  to  prove  Theorem  4.1. 


Proof  of  Theorem  4.1 


It  suffices  to  prove  that  for  any  stopping  rule  t|  the  corresponding 


threshold  of  Z1  satisfies, 
t 


An  <  Z1 
t  t 


(4-11) 


From  Lemma  3.5,  it  >  Z1  if  and  only  if 


t— 1 


it  >  (h1)'1  min  E  <  H1  l(xx=0)  +  C  l  l(xs»l) 
C  T>t  (  T  S=t 


(4-12) 


Because  of  Lemmas  4.1  through  4.3,  Eq .  4-12  gives 

„N 


>  min  E  I 


T— 1 


*  >  min  E  '  -  l(xT=0)  +  (GN)-1  C  l  l(sx=l) 

t<T<N  (  GN  t  s=t 


xt=Ti 


(4-13) 


»  ( 


>  t 


>  ( 


» 
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Equation  4-13  implies  that 


GN  n  >  mN(u)  (4-14) 

t  t 


Consequently 


41  >  AN 
t  t 


(4-15) 

QED. 


The  result  of  this  theorem  is  similar  to  that  of  [3].  However,  whereas 
in  13]  comparison  with  a  stationary  threshold  of  a  centralized  problem  is  pos¬ 
sible,  such  a  comparison  is  impossible  for  problem  (P).  Intuitively,  this  can 
be  explained  as  foiiows:  For  the  problem  considered  in  [3]  at  each  instant  of 
time  each  detector  has  to  account  for  an  average  penalty  due  to  a  false  alarm 
of  the  other  detector.  Such  a  penalty  varies  with  time,  however,  it  remains 
bounded  for  all  times  (actually  it  is  less  than  one).  This  feature  makes  pos¬ 
sible  the  comparison  of  the  decentralized  m.b.m.o.  thresholds  with  a  station¬ 
ary  centralized  threshold.  For  the  problem  considered  in  this  paper,  the  aver¬ 
age  cost  faced  by  each  detector  is  not  necessarily  bounded.  This  feature  of 
the  cost  makes  the  comparison  of  thresholds  a  more  subtle  problem;  comparison 
with  a  centralized  threshold  is  only  possible  if  that  threshold  corresponds  to 
a  cost  which  has  similar  features  as  the  average  cost  due  to  a  false  alarm  in 
problem  (P).  This  can  be  accomplished  if  one  restricts  attention  to  finite 
horizon  centralized  problems  which  result  in  time-varying  thresholds. 
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5.  CONCLUDING  REMARKS 


The  problem  considered  in  this  paper  is  similar  to  those  of  [ 3 ]  — [ 8 )  in 
that  thi^re  is  no  communication  between  the  two  detectors.  Even  though  the 
problem  treated  here  is  one  of  the  simplest  decentralized  sequential  detection 
problems,  the  coupling  induced  by  Che  cost  structure  causes,  (as  in  [3]  and 
[A]),  considerable  complexity  in  the  optimal  stopping  rules.  Computation  of 
the  optimal  thresholds  is  impossible  as  it  requires  the  solution  of  two  sets 
of  coupled  dynamic  programming  equations.  It  is  possible,  however,  to  lower- 
bound  the  optimal  decentralized  thresholds  by  the  thresholds  of  certain  finite 
horizon  centralized  quickest  detection  problems.  The  thresholds  of  these  cen¬ 
tralized  problems  are  time-varying.  A  similar  comparison  was  achieved  in  (3). 
However,  whereas  in  [3]  the  optimal  decentralized  thresholds  are  compared  to 
a  stationary  centralized  thresholds,  such  a  comparison  is  impossible  for  the 
present  problem. 

The  results  of  this  paper  provide  only  a  qualitative  characterization  of 
the  optimal  decision  rules  of  the  detectors.  The  qualitative  properties  of 
Che  optimal  decision  rules  as  well  as  the  comparison  between  the  optimal  de¬ 
centralized  thresholds  and  a  class  of  optimal  centralized  thresholds  provide 
the  basis  for  the  development  of  practical  algorithms  for  the  problem  studied 


in  this  paper. 


3 


APPENDIX  A 


Proof  of  the  Equality  of  Cost  Functions  (3.1)  and  (3.3) 


Consider  first  the  term 


!c  I 

{ *-i 


l(xA=l  ) 


(A-l) 


According  to  [1,  pp.  151-152],  Eq.  A-l  is  equal  to 


(l-nl) 

{  1=1  * 


Next  consider  the  term  E(k(0-t)  1 ( t < 0)  l(r*<0)}. 


(A-2) 


E{k(0-t)  1 ( t<0)  1 ( T*<0) |Y }  «  E {k( 0-t )  l(t<0)  hjCO) |Ylt }  (A-3) 


where 


hj(0)  =  E{1 (t*<0) | 0} 


(A-4) 


The  equality  of  Eq.  A-3  holds  because  the  observations  of  the  detectors  are 
independent  conditioned  on  the  time  of  the  jump. 

We  can  write 


E {k( 0-t )  1  ( t < 0 )  h  (0)|ylc}  =  k  l  h  (l)  U-t)  Prob(0=t|Y1t) 

Z=t+1 


k  l  h  (t +1)1  Prob(0=t+i|Y1t) 
1=1  1 


(A-5) 
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But 


h2(0)  =  E{(0-t*2)  1(t*<0)|0}  (A-l 1 ) 

Equality  in  Eq .  A-ll  holds  because  the  observations  of  the  detectors  are 
independent  conditioned  on  the  time  of  jump. 

Using  Eqs.  A-10  and  A-6  we  can  write 
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E  |  (  T2  )  »(TJ<0) 

«  (  t+4— 1 

=  [  S  I  (t+4-k)  Prob(T*=k 

4=1  (  k=0 


YlC  |  "  jj  E{(6"T2)  1(t*2<8)  9=t+i  }  (1_P)4~1  P  ”1 


0=t+fc)  >  (l-p)^"1  P  TT1 


«  (  t  +  i-1 

I  l 

1=1  (  k=0 


Prob(x*<k 


0-t+fc)  >  (l-p)^”1  P  TT1 


t 

l  Prob( x*<r 

CO 

0>t)  +  l  Prob( x*<t+4 

0>t+4)(l-p)* 

r=0  2 

4=1  2 

Note  that  when  r<t 


Prob( x*<r  | 6>t )  =  Prob( x*<r | 0>r) 


Consequently,  Eqs.  A-12  and  A-13  give 


(A-12) 


(A-13) 


|  (0-T*)  l(x*<0)  1 (t<0) |y [  = 

t 

l  Prob(x*<r 

{  1  ) 

r=0  2 

0>r) 


+  l  Prob( x*<t+4 


4=1 


0>t+4 ) ( 1-p)* 


(A-14) 


An  alternative  expression  for  kE{(0-x*)  l(x*<0)  l(t<0)|Yit}  can  be  derived 
as  follows: 


Using  Eqs.  A-6  and  A-10  we  can  write 


k  E  {(e-Tj)  l(T2<0)  J(t<0)  ylc  } 

oo 

=  k  l  e|(6-t*)  1(t*<0)  e*t+4  |  Prob(  0=t+Jl  0>s)  ir1 


k  e{( 

0-T*)  1  (  X 

k  e{( 

0-T*)  0>X* 

k  E  |  I 

0-X*)  0>T* 

k  e{( 

9-tJ  Vt)  + 

x  Prob  |  (  9— C  )>T*- 

■t  8-t>0 


where 


E(0)  +  e(t*  vt-r*)  Prob(0>T*-t)  it1 


t*  vt  -  max(x*,t) 


Because  of  Eqs.  2-1  and  2-2  we  can  write  Eq.  A-15  as 


j(e-T*)  1(t*<0)  1  ( t< 0 )  Yltj 

-k  i  ♦  t  -  e{,;  vt-T*}  sltl-p)"*"'1  j  »> 

-  k  U(t-B(,p)t  E|(l-p),,*'tl  J.l 


where 


(x]^  *  max(x,0) 


(A-15) 


(A-16) 


( A- 1 7  ) 


( A- 1 8 ) 


Combining  Eqs.  A-l,  A-2 ,  A-7 ,  A-9,  A-1A,  and  A-17  we  obtain  the  equality  of 
the  cost  functions  (3.1)  and  (3.3). 
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APPENDIX  B 


Proof  of  Lemma  3.2 

The  inequality  (Eq.  3-14)  at  if-0  is  true  because  C>0  and  (l  V1T  )  (if)  is 

t+1 

nonnegative . 

To  prove  Eq.  3-15  note  that  because  of  Eqs.  3-9  through  3-12 


(L  v!T  )  (*)  <  (l-p)  h1t  if 

t+1  t+1 


(  B — 1 ) 


Thus,  to  cotnplete  the  proof  of  the  lemma,  it  is  enough  to  show  that 


(l-p)  hit  <  Hn 
t+i  t 


(B-2) 


We  can  easily  show,  following  the  arguments  in  Appendix  A,  that 


H1T  -  -  Y  (l-p)A-l  Prob( 0>T* 
t  B  i-i  2 


k  C 

0>t+i)  +  -  l  Prob(T*<t 
B  i-o  2 


0>£  ) 


k  T-t 

+  "  l  Prob(  T*<t+i. 
B  A-l  c 


0>t+A ) (l-p)* 


(B-3) 


where 


B  -  j  (l-p)^-1  p 
1-1 


(B-4) 


The  Inequality  B-2  then  follows  directly  from  Eqs.  B-3  and  B-4.  QED 
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APPENDIX  C 


Proof  of  Lemma  4.2 
By  definicion  (Eq.  3-4) 


H1  -  k  l  (l-p)4'1  Prob( 0>t* | 0>t+l) 
t  1-1  2 


+  k  -  +  (t  -  E(T*))t 


]  EJ(.-P)<T‘-t: 


When  C>s ,  then 


fit  (s -E(T*»t  E  J(i_p)<,5-“)t  J 


It  remains  to  show  that 


Prob( 0>t* | 0>t+4 )  >  Prob(0>i*|0>s+l) 


(C-l) 


(C-2) 


(C-3) 


for  any  1,  when  t>s. 

But  Eq.  C-3  is  true  because  of  Corollary  4.1  of  [3].  Hence  H*  >  H*.  It  can 

t  s 

be  similarly  shown  that  H2  >  H2  QED. 

t  s 
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ABSTRACT 

A  Markov  chain  is  frozen  in  one  of  two  states  0,1.  Two  detectors  making 
independent  observations  must  decide  what  is  the  state  of  the  Markov  chain. 
The  observations  are  costly  and  the  decisions  of  the  detectors  are  coupled 
through  a  common  cost  function.  It  is  shown  that  the  optimal  decision  rules 
of  the  detectors  are  characterized  by  thresholds  whose  computation  requires 
the  solution  of  two  coupled  sets  of  dynamic  programming  equations.  Numerical 
solution  of  these  equations  is  very  difficult;  thus,  the  results  derived  in 
this  report  provide  only  a  qualitative  characterization  of  the  optimal  solu¬ 
tion  of  the  infinite  horizon  decentralized  Wald  problem. 
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SECTION  1 
INTRODUCTION 

In  recent  years  there  has  been  an  increasing  interest  in  distributed 
sensor  systems.  This  interest  has  been  sparked  by  large-scale  systems  such 
as  power  systems,  surveillance  systems  etc.,  where  because  of  considerations 
such  as  cost,  reliability,  survivability,  communication  bandwidth,  compart- 
mentalization,  or  even  problems  caused  by  flooding  a  central  processor  with 
more  information  than  it  can  process,  there  is  never  centralization  of  infor¬ 
mation.  Thus  distributed  estimation  and  detection  problems  have  recently 
received  considerable  attention  as  evidenced  by  [1]-[11]*. 

The  basic  features  of  the  distributed  estimation  and  detection  problems 
studied  so  far  are:  There  are  more  than  one  agents  (detectors,  estimators) 
which  have  access  to  different  information  and  have  either  to  detect  an  event 
(111— [10])  or  estimate  a  random  variable  ([11]).  The  agents'  decisions  (esti¬ 
mates)  are  coupled  through  their  common  objective.  The  agents  may  ((8))  or 
may  not  communicate  ([l]-[7],  (9)— [11]),  but  even  if  they  are  allowed  to  com¬ 
municate  they  cannot  exchange  their  raw  data  (so  that  there  can  be  no  central¬ 
ization  of  information).  In  addition,  Che  agents'  decisions  may  be  made  at 
fixed  instants  of  time  ([1]  —  [4]  ,  [8])  or  sequentially  in  time  (l5)-(7),  [9]). 
The  results  available  so  far  have  shown  that  the  distributed  information  pat¬ 
tern  and  the  coupling  of  the  decisions  through  the  cost  introduce  considerable 

♦References  are  indicated  by  numbers  in  square  brackets,  the  list  appears  at 
the  end  of  the  main  body  of  this  report. 
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complexity  in  the  -computation  of  the  optimal  decision  rules.  Nevertheless, 
the  same  results  have  revealed  a  lot  of  interesting  properties  of  the  optimal 
solutions  of  distributed  detection  and  estimation  problems;  these  properties 
can  be  used  to  guide  the  design  of  simple  suboptimal  algorithms  for  these 
problems . 

The  goal  of  the  present  report  is  to  extend  the  results  of  [ 5 J  to  the 
infinite  horizon  decentralized  Wald  problem.  The  results  derived  in  this 
report  provide  only  a  qualitative  characerization  of  the  optimal  solution  of 
the  infinite  horizon  decentralized  Wald  problem.  However,  the  basic  proper¬ 
ties  of  the  solution  derived  here  will  be  used  elsewhere,  [12],  to  guide  the 
design  of  a  simple  suboptimal  algorithm  which  is  easy  to  implement. 

The  report  is  organized  as  follows:  In  Section  2  the  formulation  of  the 
infinite  horizon  decentralized  Wald  problem  is  presented.  In  Section  3  a 
brief  summary  of  the  basic  results  of  the  finite  horizon  decentralized  Wald 
problem  is  presented.  These  results  are  extended  to  the  infinite  horizon 
problem  in  Section  A.  Conclusions  appear  in  Section  5. 
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SECTION  2 

PROBLEM  FORMULATION 

Consider  a  two-state  {0,1  }  Markov  chain  h  which  is  "frozen”  in  one  of  its 
states.  Assume  that 

Prob(h=0)  =  p  (2-1) 

Two  agents  (detectors)  attempt  to  detect  the  state  h  of  the  chain  by  taking 
noisy  observations  of  h.  The  observations  of  the  two  agents  are  described  by 

yi(t)  -  g^h,  \A(t))  i  =  1,2  (2-2) 

It  is  assumed  that  {v*(t)),  (v2(t)}  are  mutually  independent  i.i.d.  noise 
sequences  which  are  also  independent  of  h.  It  is  further  assumed  that  each 
observation  after  t=l  costs  c.  Let  uj(i  =  1,2)  be  the  final  decision  of 
agent  i  (uj  •  0  or  1),  and  Tj_  be  the  time  that  decision  was  made.  Then  the 
cost  incurred  by  these  decisions  is  assumed  to  be 

ctj  +  c t 2  +  J(Ul,  u2,  h)  (2-3) 

Under  these  assumptions  the  infinite  horizon  decentralized  Wald  problem 
is : 
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Minimize  EjcT^Yj)  +  CT2^'r2)  +  J^ui>  u2’  1 

y2er2 


Subject  to 

ui  A 


yi(y 


>  (p) 


y  1  A  [yid),  yi(2),...,yi(Ti)  ] 

Ti  A,  set  of  stopping  rules  that  are 

measurable  functions  of  the  data 
of  detector  i. 


J 


Problem  (P)  is  the  infinite  horizon  version  of  the  problem  studied  in  [5J  . 
In  the  next  section  we  shall  briefly  state  the  main  results  of  [5]  and  then 
we  shall  extend  them  to  solve  problem  (P). 
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Q 

SECTION  3 

REVIEW  OF  THE  FINITE  HORIZON  DECENTRALIZED  WALD  PROBLEM 

®  The  finite  horizon  decentralized  Wald  problem  formulated  in  [5]  is  the 

same  as  problem  (P)  with  the  additional  constraint 


<  T  ,  i  -  1,2  T  <  «  (3-1) 

i.e.,  the  agents'  decisions  have  to  be  made  no  later  than  T. 

It  was  shown  in  [5]  that  the  finite  horizon  decentralized  Wald  problem  is 
a  sequential  team  problem  with  static  information  structure;  thus  the  meraber- 
by-member  optimal  (m.b.m.o.)  strategies  of  the  two  agents  can  be  determined  by 
dynamic  programming  methods. 

The  main  results  obtained  in  [5)  are  the  following: 

Theorem  3.1 

The  m.b.m.o.  stopping  rules  of  the  two  agents  are  described  by  thresholds 

a1,  B1,  a2,  B2,  A1,  Jl2,  k  =  1,2,...,T-1  (T  =  horizon  of  the  problem).  The 
k  k  k  k  T  T 

thresholds  of  the  two  agents  are  coupled;  namely,  the  thresholds  of  agent  1(2) 
at  any  time  t  are  coupled  with  the  thresholds  of  detector  2(1)  at  all  times, 
past,  present  and  future.  The  computation  of  these  thresholds  requires  the 
solution  of  4T-2  nonlinear  algebraic  equations  in  4T-2  unknowns.  The  nonlin¬ 
ear  algebraic  equations  which  result  from  dynamic  programming  arguments  and 
whose  solution  determines  the  thresholds  for  agent  I  are  of  the  form 
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a*  l  p(u2|b=0)[cT2  +  J(l,  u 2,  0)] 


+  (l  -  aM  l  p(u2|h=l  )[ct2  +  J(l,  u2,  1  ) 
U2 


C  +  Ey,(k+D  r^h 


a1  p(y  (k+l)|h=0) 
k  1 


a1  p(y  (k+l)|h=0)+  (l-a1)  p(y  (k+l)|h«*l) 
k  1  k  1 


(3-2) 


S,1  l  p(u,|h=0)[cT  +  J(0,  u  ,  0)] 


k  “  ’  '  2 

U2 


i1 

T 


A(T) 

D(T) 


(3-4) 


A(T)  =  l  P(u2|h-I)[j(0,  1)  -  j(l,  u2,  l)]  (3-3) 

u2 

D(T)  «  A(T)  +  l  p(u  |h-0)[j(l,  u  ,  0)  -  j(0,  u2,  0)]  (3-6) 

U2 

In  Eqs.  3-2  and  3-3  V1T  is  the  value  function  of  the  dynamic  program  for 

k 

agent  l.  Similar  equations  hold  for  agent  2. 

The  conditional  probability  ti*  _A  p(h=0|y^t)  is  a  sufficient  statistic 

for  decisionmaking  for  agent  i  (i  *  1,2).  At  any  time  t  (t  =  1,2,...,T-1)  the 

action  of  agent  i,  based  on  the  statistic  ,  can  be  described  as  follows: 

t 
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k  a 


k  . 


G  K 


fc  k 


* 


1.  If  ni  >  0*,  declare  uj  =  0  and  stop. 

t 

2.  If  a1  <  71*  <  gi ,  continue  taking  measurements. 

t  t  t 

3.  If  tt ^  <  a* ,  declare  u-^  =  1  and  stop. 

t  t 

At  time  t=T  the  decision  of  agent  i  is: 

1.  If  n*  >  A*,  declare  u<  =  0 

t  T 

2.  If  it*  <  4*,  declare  u-i  =  1. 

t  T 

Thus,  the  stopping  times  t*  (i  =  1,2,)  have  the  following  property: 


Ti  »  min  it:  tt*  <  c4  or  tt*  >  £}*  i 

t  t  t  t  t 


If  there  is  no  t  such  that  Eq.  3-7  holds,  then 


Ti  =  T 


1,2  ,  1  <  t  <  t|  (3-7) 


(3-8) 

m 


The  coupling  of  the  thresholds  of  the  two  agents  arises  because  of  the 

presence  of  the  terms  p(u2|h=i)  in  Eqs.  3-3  through  3-6;  these  terms  are 

functions  of  (a2,  02,...,a2  ,  02  ,  i2). 

1  1  T- 1  T- 1  T 

The  value  functions  ViT(ni)  have  the  following  properties: 

k  t 

Lemma  3.1 


The  value  function  V ( tt  1 )  is  a  nonnegative  concave  function  of  u1  and 

t  t  t 

satisfies  the  following  equations: 


V*T(**)  *  mln  I"*  I  P  Cu2 1 h=0 )  J(°»  u  2»  °) 
U  2 


+  I  P  ( u  2  I  h”  1  )  J(0,  u2,  l)  , 


I  P(u,|h=0)  J(l,  u5,  0)  +  (l-if^)  l  p(ujh=l  )  J(l,  u2,  1  )j  (3-9) 


T  '2 
1  u 


T'  -  -2' 
1  u , 


,  u  ,  1)!  (3-5 
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VlT(,l)  =  min  {  min 

t  t  )u 


e{0,l  } 


I  P(u,lh’0)  u2,  0) 


+  (l-u1)  l  p(u2|h=l)  J(u1,  u2,  l) 
^  11  ** 


c  +  Ey i< t+i )  {v^j 


fc 


p(y  (t+1)  |h=0) 


_  I 

*  p(y^(t+l) |h=0)  +  (l  —  n  1 )  p(y  ^(t+1)  |h*=l  )/ j 


(3-10) 


Similar  properties  and  equations  hold  for  V2T(u2).  [Y] 


The  results  of  Theorem  3.1  and  Lemma  3.1  will  be  used  to  solve  the 


infinite  horizon  decentralized  Wald  problem. 
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SECTION  4 

SOLUTION  OF  PROBLEM  (P) 


Problem  (P)  is  the  infinite  horizon  version  of  the  problem  investigated 

in  15].  Thus,  in  order  to  solve  it,  it  is  sufficient  to  study  the  behavior 

and  properties  of  the  value  functions  ( *  1 )  i  =  1,2)  as  T  +  •.  Let  T  *  «. 

t  t 

Since  the  set  of  stopping  times  tj  {Tj_  <  T}  increases  with  T  it  follows  that 
for  all  t 

viT(Ti)  <  vi(t+l)(„)  i  =  1,2  . 

t  t 

Consequently  the  following  limit  is  well  defined: 

V*(u)  A  lim  viT(n)  A  Inf  V*T(  n)  =  Vi(ir)  (4-1) 

t  t  *"  t  c 

It  is  possible  to  show  that  the  value  functions  V*-(ir)  have  the  following 
properties : 


Lemma  4.1 

The  value  function  V^tt)  is  a  nonnegative  concave  function  of  h  which 
satisfies  the  equation 


V^it1)  =  min  <>  rain  jir1  \  p(u  |h=0)  J (u  ,  u  ,  0) 

(vl0,U  I  u2  \ 

+  (I"”1)  l  P(u2 1^=1  )  u2,  l)j  , 

«♦  J.-( - -‘--ly-'.|h:0) - )|j 

(  \*1  p  (y  j  I  h=o )  +  (l- n 1 )  p(yj|h-i)/j  ( 


(4-2), 
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Moreover . 


it1  p(y1|h=0) 


is  a  nonnegative  concave 


p(y ! i h=0 )  +  (1_7rl)  p (y j 

function  of  n1  and  the  following  inequality  holds  at  both  ti  1  =  0  and  n1  =  1 


I  nl  l  p(u2|h=0)  J(u1,  u2,  0]  +  (l-fl1)  l  p(u2|h=l  )  J  (u  ,  u2>  1  )  | 
UjCW.l }  (  u2  u2  ) 


min 


<  c  +  E. 


,W - -  - )\ 

(  x*1  p(yJh=o)  +  (1-n1)  p(y1|h=i)/j 


(4-3) 


Similar  properties  and  inequalities  hold  for  V2(n2). 


Proof ;  The  proof  that  Ey 


is  a 


L/ - - \j 

1  (  W  p(yjh=0)  +  (i-tt1  )  p(y  Jh=l )/  | 
nonnegative  concave  function  as  well  as  the  inequalities  at  tt1  *  0,  ir 1  =  1 
follow  directly  from  Lemma  3-2  of  (5).  v’(n)  then  is  nonnegative  and  concave 

as  the  minimum  of  nonnegative  concave  functions  QED. 

In  Eq.  4-2  the  term 


-  !  ir1  l  p(u2|h=0)  jfu^  u2,  0)  +  (l-n1)  l  p(u  |h»l  )  J  (u  ,  u  1  )  1 

UjEtO,!}  (  u2  u2  ' 


min 


( 


describes  the  cost  incurred  by  stopping  at  a  certain  time  and  the  term 


S  J  1,1  p(y  |h=0)  \) 

1  (  \  p(yJh=o)  +  (l-w1)  p(y  Jh-i )/  ) 

describes  the  cost  incurred  by  continuing  to  take  observations  at  that  time. 
This  fact  and  Lemma  4.1  can  be  used  to  prove  the  main  result  of  this  report 
which  is  given  by  the  following  theorem: 
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Theorem  4.1 

The  raember-by-member  optimal  strategies  of  problem  (P)  are  described  by 
thresholds  (a1,  B1 ,  a2,  62).  The  thresholds  of  the  two  detectors  are  coupled 
and  their  computation  requires  the  solution  of  the  following  set  of  nonlinear 
algebraic  equations: 

al  l  p(u2|h=0)  J(l ,  u2>  0)  +  (l-a1)  l  p(u2|h=i  )  j(l ,  u2>  1  ) 

u  „ 

2  2 

.  c  +  E  K'(  PCyl  lh-°)  \) 

Vl  I  Xa1  ptyjh-o)  +  (l-a1)  pEyJh-l)/)  (4-4) 


B1  l  P(u2|h-0)  J(0,  u2,  0)  +  (l-B1)  l  p(u2|h=l)  J(0, 


*  c  + 


'yi 


I  v,  (  6‘  \  I 

I  \s‘  p(y,|h*0)  +  (l-B1)  p [y J ] h- 1  )/  j 


(4-5) 


Equations  similar  to  4-4  and  4-5  hold  for  a2,  B2,  respectively. 


The  conditional  probabilities  n*  _A  Prob(h=0|yit )  are  sufficient  statistics  for 

decisionmaking.  At  any  time  t  the  action  of  agent  i  based  on  the  statistic  i4 

t 

can  be  described  as  follows: 

1.  If  i*  >  declare  u^  *  0  and  stop. 

t 

2.  If  a*  <  n*  <  3*,  continue  taking  measurements. 

t 

3.  If  it*  <  a*- ,  declare  m  *  1  and  stop. 

t 


Hence,  the  stopping  times  Tj  (i  ■  1,2)  of  the  two  agents  have  the  following 
property : 

tj  «  min  jt:  it*  >  B*  or  n*  <  a*  |  (4-6) 
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Proof :  The  threshold  property  of  the  m.b.m.o.  solutions  follows  directly  from 

Lemma  4.1.  Equations  4-4  and  4-5  follow  from  Eqs.  3-2,  3-3  and  4-1.  The 
action  an  agent  has  to  take  at  any  time  t  as  well  as  the  property  4-6  of  the 
stopping  times  follow  from  Eq.  4-2  and  the  interpretation  of  each  one  of 
the  terms  of  the  right-hand  side  of  Eq .  4-2.  QED 

The  threshold  properties  of  the  m.b.m.o.  solutions  resulting  from  Lemma 
4.1  and  Theorem  4.1  are  shown  in  Fig.  4-1.  The  coupling  of  the  thresholds 
results  because  of  the  presence  of  terras  p(u2|h=0)  and  p(u  |h*l)  which  are 
functions  of  (a2,  02).  In  general,  p(u2|h=0)  and  p(u2|h=l  )  are  very  compli¬ 
cated  functions  of  (a2,  82)  and  that  is  why  it  is  very  difficult  to  solve 
Eqs.  4-4  through  4-5  and  their  counterparts  for  a2,  82  numerically.  Thus,  the 
results  of  this  note  describe  only  the  qualitative  properties  of  the  m.b.m.o. 
solutions  of  the  infinite  horizon  decentralized  Wald  problem.  Nevertheless, 
the  qualitative  properties  of  the  m.b.m.o.  solutions  can  be  used  to  guide  the 
design  of  simple  suboptimal  algorithms  which  are  easy  to  implement.  Such  an 
algorithm  will  be  described  in  (12] . 
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SECTION  5 
CONCLUSIONS 

The  results  presented  in  this  report  extend  the  results  of  [5]  to  the 
infinite  horizon  decentralized  Wald  problem.  Even  though  the  infinite  horizon 
results  in  m.b.m.o.  solutions  which  are  described  by  stationary  thresholds, 
it  is  still  very  difficult  to  compute  these  thresholds  because  the  coupling 
of  the  agents  through  the  cost  results  in  extremely  complicated  dynamic  pro¬ 
gramming  equations  for  the  two  agents.  Thus,  the  design  of  suboptlraal  algo¬ 
rithms  which  take  advantage  of  the  qualitative  properties  of  the  m.b.m.o.  and 
are  easy  to  implement  is  necessary. 


I 


I 
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Q 

ABSTRACT 

Two  Sequential  Distributed  Detection  Problems  are  considered:  (1)  The 

a 

Infinite  Horizon  Decentralized  Wald  Problem  and  (2)  a  problem  with  communicat 
ing  detectors  and  ordered  stopping  times.  The  qualitative  properties  of  the 
optimal  solution  of  the  Infinite  Horizon  Decentralized  Wald  problem  are  known 
but  the  computation  of  the  optimal  solution  is  very  difficult.  Sequential 
Distributed  Detection  problems  with  communication  are  in  general  not  sequen¬ 
tially  decomposable,  therefore  very  difficult  to  solve. 

Suboptimal  algorithms  are  proposed  for  the  solution  of  the  problems  above. 
These  algorithms  combine  the  qualitative  features  of  the  optimal  solution  of 
^  the  Infinite  Horizon  Decentralized  Wald  problem  and  results  from  Sequential 

Analysis.  They  are  simple,  easy  to  implement  and  the  numerical  results 
obtained  are  intuitive. 
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SECTION  1 
INTRODUCTION 

Distributed  estimation  and  detection  problems  have  recently  received 
considerable  attention  ll]-[3],  (5]-[12}.  The  interest  in  these  problems  was 
sparked  by  large-scale  systems  such  as  surveillance  systems,  power  systems, 
etc.,  where  there  is  no  centralization  of  information. 

The  distributed  detection  problems  studied  so  far  can  be  classified  into 
(1)  static  problems  without  communication  [5]-[7],  (2)  static  problems  with 
communication,  and  (3)  sequential  problems  without  communication  [lj  —  J3J  ,  [8]. 

The  distributed  sequential  detection  problems  solved  in  [1]  —  [3]  ,  [ 8 }  are 
hypothesis  testing  problems  where  there  are  two  or  more  detectors  who  have 
access  to  different  information  and  have  to  detect  the  correct  hypothesis 
quickly  and  accurately.  The  detectors  do  not  communicate  but  they  all  have 
a  common  objective,  thus  they  are  coupled  only  through  their  cost  function. 
Even  in  this  simple  case  it  has  been  shown  (ll]-[3),  [8])  that  the  coupling 
through  the  cost  causes  considerable  complexity  in  the  computation  of  the 
optimal  stopping  rules  of  the  detectors.  More  specifically,  it  has  been  shown 
that  the  optimal  decision  rules  are  characterized  by  thresholds  whose  computa¬ 
tion  requires  the  solution  of  a  set  of  coupled  dynamic  programming  equations. 
Numerical  solution  of  these  equations  is  in  general  very  difficult,  hence 


*References  are  indicated  by  numbers  in  square  brackets,  the  list  appears  at 
the  end  of  the  main  body  of  this  report. 
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simple  suboptimal  solutions  which  exploit  the  qualitative  features  of  the 
optimal  solutions  are  needed. 

The  purpose  of  this  report  is  to  describe  a  suboptimal  solution  for  two 
sequential  distributed  detection  problems. 

1.  The  infinite  horizon  decentralized  Wald  problem;  and 

2.  A  distributed  sequential  hypothesis  testing  problem  with 
communication. 

The  report  is  organized  as  follows:  The  sequential  distributed  detection 
problems  mentioned  above  are  formulated  in  Section  2  and  their  basic  features 
are  discussed  in  Section  3.  The  proposed  suboptimal  algorithms  for  their 
solution  appear  in  Section  4.  Numerical  results  for  the  infinite  horizon 
decentralized  Wald  problem  appear  in  Section  5.  The  report  concludes  with  a 
summary  and  suggestions  for  further  research  which  appear  in  Section  6. 
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SECTION  2 

PROBLEM  FORMULATION 

^  Consider  a  hypothesis  testing  problem  where  one  of  two  events  Is  true, 

i.e.,  if  h  is  the  event  to  be  identified,  then  h  e  {0,1}. 

Consider  two  detectors  who  observe  the  event  in  a  noisy  environment;  the 
“*  observations  y*  of  detector  i  (i  *  1,2)  are  described  by 


y*(t)  -  h  +  v*(t) 


(2-1) 


It  is  assumed  that  (vl(t)}  (i  «  1,2)  are  independent  identically  distributed 
sequences  which  are  independent  of  each  other  and  independent  of  the  event  h. 
Thus,  conditioned  on  the  event,  the  observations  of  the  two  detectors  are 
independent.  The  noise  sequences  (v*(t)},  i  ■  1,2  are  zero  mean  normal  random 
variables  with  variance  o.  It  is  further  assumed  that  the  observation  at  time 
t  *  1  is  free  for  both  detectors,  but  each  additional  obsevation  is  costly. 

Let  C  be  the  cost  of  each  additional  observation  after  t  *  1 . 


Denote  by  u^  the  decision  of  detector  i  and  let  t^  be  the  time  this  decision 
I  was  made.  If  h  is  the  true  hypothesis,  then  the  cost  incurred  by  the  deci¬ 

sions  Uj ,  u2  is  assumed  to  be 


J^,  u2,  h) 


0  if  u  -  u  -  h 

1  if  u  *  u 
1  2 

k  if  Uj  =  u2*h  ,  k>l 


(2-2) 
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Under  these  basic  assumptions  (and  some  additional  ones  which  are  character¬ 
istic  to  each  problem)  the  infinite  horizon  decentralized  Wald  problem  and 
the  sequential  distributed  detection  problem  with  communication  and  ordered 
stopping  times  can  be  formulated  as  follows: 

2.1  THE  INFINITE  HORIZON  DECENTRALIZED  WALD  PROBLEM 

Consider  that  the  two  detectors  do  not  communicate;  thus,  each  detector's 
decision  is  based  only  on  the  data  available  to  it . 

Then,  under  all  the  above  assumptions  the  decentralized  Wald  Problem  is 

Minimize  £^1^  +  ct^  +  J(u^,  u^,  h)  j 

W  Vr2 

where  is  the  stopping  time  of  detector  i 

Ti "  VylTi) 

and 

ylTi  :  -  (y1(l) ,  yi(2),...,y1(T1)) 

2.2  A  SEQUENTIAL  DISTRIBUTED  DETECTION  PROBLEM  WITH  COMMUNICATION 
AND  ORDERED  STOPPING  TIMES 

Consider  that  detector  1  makes  a  decision  first  and  communicates  this 
decision  to  detector  2.  Then,  based  on  its  information  and  the  message 
received  by  detector  1,  detector  2  makes  a  decision  at  some  time. 

Under  the  basic  assumptions  and  the  assumption  above  a  sequential  detec¬ 
tion  problem  with  communication  and  ordered  stopping  times  can  be  formulated 


as  follows: 
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Minimize  E  (cij  +  ct^  +  JCuj,  u2>  H)  \  (2-6) 

W  Y2tr2 


where  Tj  is  the  stopping 


time  of  detector  i 

(2-7) 

(2-8) 


t2  >  T1 


and  yiTi  is  defined  as  in  Eq.  2-5. 


(2-9) 


I 

I 
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SECTION  3 
DISCUSSION 

Since  k  >  1  the  cost  J(Uj,  u2,  h)  is  in  general  not  separable  (i.e., 

J(Uj,  V  h)  *  J(ulf  h)  +  J(u2,  h)),  consequently  in  both  problems  the  deci¬ 
sions  of  the  two  detectors  are  coupled. 

The  absence  of  communication  between  the  detector  makes  the  decentralized 
Wald  problem  a  sequential  static  team  problem.  Thus,  the  member-by-member  op¬ 
timal  decision  rules  of  the  detectors  can  be  determined  by  backward  induction. 
It  was  shown  in  [3]  that  the  meraber-by-member  optimal  decision  rules  are  char¬ 
acterized  by  thresholds  whose  computation  requires  the  solution  of  two  coupled 
sets  of  dynamic  programming  equations.  Numerical  solution  of  these  equations 
is  very  difficult,  thus  a  simple  suboptimal  solution  is  desirable. 

When  the  detectors  communicate  then  the  problem  of  determining  the  opti¬ 
mal  stopping  rules  becomes  much  harder,  because  in  this  case  the  information 
of  one  detector  depends  on  the  stopping  rule  used  by  the  other  thus  resulting 
in  a  dynamic  team  problem  which  is  not  sequentially  decomposable.  That  is  why 
good  suboptimal  solutions  are  very  valuable. 

In  the  next  section  a  simple  suboptimal  solution  to  the  problems  above 
will  be  presented.  The  proposed  solutions  combine  the  features  of  the  member- 
by-member  optimal  solutions  of  the  decentralized  Wald  problem  with  some  stan¬ 
dard  results  from  statistical  sequential  analysis  [A). 
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SECTION  4 

A  SUBOPTIMAL  SOLUTION  FOR  THE  PROPOSED  PROBLEMS 

4.1  A  SUBOPTIMAL  SOLUTION  FOR  THE  INFINITE  HORIZON  DECENTRALIZED  WALD  PROBLEM 
The  basic  idea  of  the  proposed  suboptimal  solution  is  the  following. 

Let  0^(0^)  be  the  probability  of  error  of  type  1  for  detector  1(2)  (i.e., 
the  probability  that  if  h  =  0  detector  1(2)  will  declare  h  *  1);  similarly, 
let  be  tbe  Probabllity  error  of  type  2  for  detector  1(2)  (i.e.,  the 

probability  that  if  h  *  1  is  true  detector  1(2)  will  declare  h  *  0).  We  shall 
write  the  cost  (Eq.  2-4)  as  a  function  of  these  four  quantities  and  then  we 
shall  minimize  that  cost  jointly  over  o^,  c»2,  0^  After  c^,  a2,  0^,  02 

are  determined,  standard  results  from  statistical  sequential  analysis  will  be 
used  to  determine  the  thresholds  for  the  two  detectors  and  afterward  the  solu¬ 
tion  of  problem  (Q ^ )  will  be  determined  graphically. 

From  statistical  sequential  analysis  ([4])  it  is  known  that  the  average 
number  of  observations  required  to  reach  a  decision  with  errors  and  a2  is 
approximately 
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when  the  event  h  *  1  is  true. 

Relations  similar  to  Eqs.  4-1  and  4-2  hold  for  detector  2  with  0j  and  02 
in  place  of  and  c»2,  respectively.  Using  Eqs.  4-1,  4-2  and  2-3  we  can 
approximately  write  the  cost  to  be  minimized  as 


E  |cri  +  ct2  +  J(ut,  u2,  h) | 


2oc(l-p) 


1  "  %  a2  1  "  02 
(1  -  a2)log  - f  +  ct2  log  -  +  (1  -  &2)log  - 


1 


1  -  a 


1 


0 


1 


+  02  log 


1  -  0, 


-  2ocp 


1  -  c  a 

log  - f  +  (1  -  a^log  - - — 

a.  1  -  a. 


1-0  0 
+  0j  log  _ ?  +  (1  -  O^log  2 


0 


1 


1  -  0, 


+  (1  -  a2) 02  (1  -  p) 


+  a2(l  -  02)  (1  -  p)  +  a^(  1  -  0^p  +  (1  -  Oj)  01  p  +  k  aJ01  p 


+  k  a202(l  "  p)  :  ”  f(ai»  a2>  Bi»  B2J 


(4-3) 


k 


The  minimization  of  f(a  ,  a2>  @2)  over  these  four  parameters  gives  their 

optimal  values.  Then  the  relations 


1  -  a„ 


Aj  -  log 


(4-4) 


A2  -  log 


1  -  a. 


(4-5) 
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(4-6) 


(4-7) 


ft 

I 


from  standard  sequential  analysis  ([4])  can  be  used  to  determine  the  thresholds 

of  the  detectors.  After  the  thresholds  are  specified  by  Eqs.  4-4  through  4-7 

the  decisions  of  the  two  detectors  can  be  graphically  determined  by  Fig.  4-1. 

At  any  time  t,  the  sum  of  observations  £  y*(t)  up  to  that  time  is  a 

t 

sufficient  statistic  for  decisionmaking  for  detector  1.  As  long  as  the  sum 

£  y*(t)  remains  between  the  two  parallel  lines  A^,  (Fig.  4-1)  detector  1 

continues  to  take  measurements.  The  first  instant  of  time  the  sum  £  y*(t)  is 

t 

above  Jlj  or  below  detector  1  stops  and  accepts  h  »  1  if  £  yJ(t)  is  above 
Jlj  and  h  »  0  if  £  y*(t)  lies  below  Aj.  Similar  results  holds  for  detector  2. 

The  proposed  algorithm  is  suboptimal  because  the  expressions  4-1  and  4-2 
for  the  average  number  of  measurements  are  only  approximate;  the  exact  compu¬ 
tation  of  the  average  number  of  measurements  is  very  complicated.  Even  though 
suboptimal,  the  algorithm  proposed  above  captures  some  of  the  basic  features 
of  the  optimal  solution  of  the  infinite  horizon  decentralized  Wald  problem; 
namely,  the  decision  rules  of  the  two  detectors  are  described  by  thresholds 
and  the  thresholds  are  given  in  terms  of  the  probabilities  of  error  of  type  1 
and  type  2  which  are  determined  by  joint  optimization  for  the  two  detectors. 

The  joint  optimization  is  very  simple  as  it  only  requires  the  minimization  of 
Eq.  4-3  with  respect  to  a^,  a^,  £5^,  As  it  will  be  seen  in  Section  5,  the 

numerical  results  obtained  by  the  algorithm  are  intuitively  appealing. 
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Figure  4-1.  Detector  i’s  Decision  Rule 
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4.2  A  SUBOPTIMAL  SOLUTION  FOR  A  SEQUENTIAL  DISTRIBUTED  DETECTION  PROBLEM 
WITH  COMMUNICATION  AND  ORDERED  STOPPING  TIMES 

The  basic  idea  of  the  suboptimal  solution  for  the  problem  (Q^)  of  Section 
2  is  the  following: 

Let  ,  a^,  8^,  82»  c^e  probabilities  of  error  of  type  1  and  type  2 
for  detector  i  and  detector  2,  respectively.  Following  the  procedure  of  sub¬ 
section  4.1  we  can  determine  (off-line)  first  the  probabilities  c^,  a 2,  Bj,  B2, 
which  minimize  the  cost  (Eq.  4-3)  and  afterwards  the  thresholds  A(  a  ,  a^) , 

A2^ai*  a2^  Bi^i»  B2^1*  B2^  uslng  Eq*  through  4-7.  Then,  the  deci- 

of  the  two  detectors  can  be  graphically  determined  as  follows:  Detector  l's 
decision  is  determined  in  exactly  the  same  manner  as  in  subsection  4.1  (Fig. 
4-1).  Detector  2  uses  detector  l's  decision  as  well  as  the  time  that  decision 
was  made  and  treats  them  as  an  additional  observation  in  the  following  manner: 
If  the  decision  of  detector  i  is  Uj  =  1  and  is  made  at  t  =  t ^ ,  this  means 

that 


Tl 


l  ykt)  >  A  (a  ,  a  )  +  _i  ; 
t=l  1  1  z  2 


(4-8) 


then,  detector  2  treats  the  decision  u^  =  1  at  as  an  additional  measure- 

1 

ment  with  value  A(a  ,  a  )  +  -  T  ,  adds  it  to  his  previous  measurements  and 

12  2  1 

proceeds  from  that  point  on  to  make  a  decision  in  the  same  way  as  in  subsec¬ 
tion  4.1 . 

If  the  decision  of  detector  1  is  u^  =  0  and  is  made  at  t  =  s^,  this  means 

that 


S1  s 

l  y l(t)  <  A  ( a  ,  a  )  +  _i 
t=L  2 


(4-9) 


» 


> 


» 


I 


I 
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then,  detector  2  treats  the  decision  u^  =  0  at  s^  as  an  additional  measure- 

s 

ment  with  value  -  ^  A(a^,  a^)  +  _ -  )  >  a^ds  it  to  the  previous  measurements 

and  proceeds  from  that  point  to  make  a  decision  in  the  same  way  as  in  subsec¬ 
tion  4.1  (Fig.  4-1). 

The  suboptimal  algorithm  proposed  here  is  graphically  depicted  in 
Fig.  4-2. 

In  the  suboptimal  algorithm  proposed  above,  the  interpretation  of  the 
first  detector's  message  by  the  second  detector  is  very  easy  because  it  is 
assumed  that  the  stopping  rules  used  by  the  detectors  are  arbitrarily  fixed 
and  known  beforehand  to  both  of  them.  When  the  optimal  solution  to  problem 
(Q2)  is  sought,  then  the  interpretation  of  the  message  of  detector  1  depends 
on  the  stopping  rule  used  by  that  detector;  since  the  stopping  rule  has  to 
be  determined,  the  interpretation  of  the  message  by  the  second  detector  is  a 
highly  nontrivial  task  and  leads  to  optimization  problems  which  are  not  se¬ 
quentially  decomposable.  Since  it  is  in  general  very  difficult  to  solve  such 
problems,  simple  suboptimal  solutions  like  the  one  proposed  above  must  be 
investigated  and  evaluated. 
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Figure  4-2b.  Detector  2's  Decision  Rule 
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SECTION  5 
NUMERICAL  RESULTS 

In  this  section  the  numerical  results  obtained  by  the  implementation  of 
the  algorithm  proposed  in  subsection  4.1  are  presented.  The  probabilities 
of  error  (c^,  a2,  0^  02>  as  well  as  the  thresholds  A^Ca  ,  a2),  A^a^,  <*2) , 

02),  B  (0  ,  02)  have  been  computed  for  various  values  of  the  following 
parameters : 

1.  The  prior  probability  p  **  Prob(h  =  0); 

2.  The  variance  o  of  the  observation  noise; 

3.  The  cost  c  of  the  observations; 

4.  The  penalty  k  arising  when  both  detectors'  decisions 
are  wrong. 

We  shall  present  each  one  of  our  parametric  studies  separately,  and  we  shall 
interpret  the  results  obtained  by  these  studies.  Note  that  the  cost  (4-3)  is 
a  nonconvex  function  of  (c^,  a2>  0^  02) ,  thus  the  values  of  (c^,  a2>  0^,  02) 
determined  by  the  minimization  of  Eq.  4-3,  correspond  to  local  minima.  These 
minima  depend  on  the  initial  guess  of  (c^,  a2,  0^,  02>  used  in  the  minimiza¬ 
tion  of  Eq.  4-3;  as  it  will  be  shown  below  different  initial  guesses  give 
rise  to  different  solutions  for  c^,  a2>  0 L ,  02  and  A^c^,  a2) ,  A2(a  ,  a2> , 

B 1 C 0 j ,  02) ,  B2(0j,  02),  Some  of  the  local  minima  of  Eq.  4-3  result  In  a  *  0 
and  <»2  =  02<  Such  local  minima  are  obtained  when  the  minimization  of  Eq .  4-3 

is  initiated  with  a^-n  *  a^n  m  0^n  =  0^n.  The  numerical  results  we  present 
12  12 

below  correspond  to  minima  for  which  ,  a2  =  ^2  * 
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5.1  THE  VARIATION  OF  c^,  a2,  Bp  B2,  A^Oj,  a2) ,  A^c^,  a2) ,  B^Bj,  B2) , 
B2(Blt  B2),  AS  A  FUNCTION  OF  p,  k 

Figures  5-1  through  5-3  present  the  variation  of  the  probabilities  of 
error  of  type  1  and  2  as  well  as  the  variation  of  the  thresholds  as  functions 
of  the  prior  proability  p  for  fixed  c,  k,  o.  It  is  expected  that  as  p,  the 
prior  probability  of  h  »  0,  increases,  the  likelihood  that  the  final  decision 
will  be  h  =  1  decreases.  Using  the  results  of  [3]  we  can  justify  this  claim 
as  follows: 

The  final  decision  of  detector  i  is  h  =  1  only  if 

p(h  =  Ojylt)  <  2^  at  time  t  and 

£*■  <  p(h  ■  0|yic)  <  1*  for  1  <  s  <  t  , 

1  2 

where  l and  2*  are  the  thresholds  determined  in  13] .  As  the  prior  proba- 
12 

bility  p  of  h  »  0  increases  the  probability  of  the  sest  of  measurements  yit; 
that  could  cause  p(h  =  Oly*11)  to  drop  below  2*  decreases,  thus  decreasing  the 
probability  of  error  of  type  1.  On  the  contrary,  as  p  increases,  the  proba¬ 
bility  of  the  set  of  measurements  y^c  that  would  result  in  p(h  **  0|yic)  <  2* 

2 

increases,  consequently  the  probability  of  error  of  type  2  increases.  This 
is  indeed  shown  in  Figs.  5-1  and  5-2. 

The  thresholds  A^Oj,  a^) ,  A^o^,  a2>,  B^Bp  02>  ,  B^Sj,  ^  are  also  ex“ 
pected  to  vary  with  p  as  follows:  As  p  increases  each  detector  would  be 
biased  more  and  more  towards  declaring  h  =  0.  Consequently,  as  p  increases, 
the  area  where  h  ■  0  is  accepted  in  Fig.  4-1  would  increase  and  the  area 
where  h  *  1  is  accepted  in  Fig.  4-1  would  decrease;  hence,  the  thresholds 


ALPHATECH,  INC. 


Aj(c»i ,  a2)  ( B  j  ( B  j ,  02>)  and  A^Oj,  ap  (BjCSj,  02>  )  defined  by  Eqs.  4-4 
through  4-7,  should  both  increase  as  p  increases.  This  is  indeed  shown  in 
Fig.  5-3.  The  results  of  Figs.  5  1  through  5-3  assume  that  k  =  4,  c  »  0.1, 
o  =  0.5.  When  the  cost  of  observations  and  the  noise  variance  in  the  obser¬ 
vations  remain  unchanged  but  the  terminal  cost  k  due  to  two  errors  changes, 
then  it  is  expected  that  the  probabilities  c^,  a2>  0  ,  02  as  well  as  the 
thresholds  A^a^  a2>,  A^c^,  «2) ,  02> ,  B^B^  &2)  will  change. 

More  specifically,  it  is  intuitively  expected  that  as  k  increases  the  error 
probabilities  ,  a2>  0^  02  will  decrease  for  a  fixed  p.  Such  a  behavior  is 
expected  for  the  following  reason:  As  the  penalty  due  to  errors  increases, 
the  detectors  tend  to  become  more  conservative  and  more  cautious,  thus  they 
tend  to  base  their  decisions  on  more  reliable  information;  consequently,  the 
probability  of  error  is  reduced.  This  can  be  verified  by  comparing  Figs.  5-1 
and  5-2  with  Figs.  5-4  and  5-5.  It  can  be  seen  from  this  comparison  that  for 
the  same  p,  c,  o, 

aj(k  »  6)  <  a^(k  =  4) 

<*2(k  -  6)  <  a2(k  =  4)  . 

A  continuous  variation  of  the  probabilities  of  error  versus  k  is  shown  in 

Fig.  5-6.  (Some  of  the  local  minima  result  in  “  0j  «  02  when  p  -  0.5 

and  the  minimization  of  Eq.  4-3  is  initiated  with  a*n  =  a*n  «  0*n  »  0in. 

12  12 

Thus,  Fig.  5-6  describes  the  variation  of  a^,  c*2,  0^,  02  as  a  function  of  k) . 

As  far  as  the  behavior  of  the  thresholds  AjCc^,  a2) ,  A^Oj,  a2> ,  B  (0  ,  02> 
and  B2 ( 0 j ,  @2)  as  a  function  of  k  is  concerned,  we  can  expect  the  following: 
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Since  as  k  increases,  the  two  detectors  become  more  conservative  and  more 
cautious,  the  areas  of  Fig.  4-1  where  h  =  0  and  h  =  1  are  accepted  should 
get  smaller.  Consequently,  as  k  increases,  the  upper  threshold  A^c^,  a 2) 
(BjCSj,  62>)  should  increase  whereas  the  lower  threshold  A^c^,  a2) 

(B2(0^,  B2))  should  decrease.  This  behavior  can  be  verified  by  comparing 
Figs.  5-3  and  5-7.  The  continuous  variation  of  the  thresholds  versus  k  is 
shown  in  Fig.  5-8. 

5.2  THE  VARIATION  OF  ^ ,  a2,  Bj,  &2>  A^a^  a2) ,  A^cy  a2) ,  B^Bj,  B2> , 

B2(6r  AS  A  FUNCT10N  0F  c>  0 

Figures  5-9  and  5-10  present  the  variation  of  the  probabilities  of  error 

of  type  1  and  2  as  well  as  the  variation  of  the  thresholds  as  a  function  of 

the  observation  cost  c  for  fixed  p,  k,  o.  We  set  p  ■  0.5;  then  some  of  the 

local  minima  result  in  *  <*2  ■  Bj  “  82  when  the  minimization  of  Eq.  4-3  is 

initiated  with  ain  -  o4n  *  3in  *  gin.  thus.  Fig.  5-9  describes  the  variation 
12  12 

of  a  ,  a2,  Bj,  B2  as  a  function  of  c.  It  is  intuitively  expected  that  as  the 

cost  of  observations  increases  the  detectors  would  tend  to  take  less  obser¬ 

vations,  therefore,  the  quality  of  information  upon  which  their  decisions 
are  based  would  get  worse  with  increasing  c,  consequently  the  probabilities 
al’  a2’  ®i»  wou*c*  increase.  This  behavior  is  actually  shown  in  Fig.  5-9. 
The  variation  of  the  thresholds  with  c  is  expected  to  be  the  following:  As 
c  increases  the  detectors  would  tend  to  make  a  final  decision  more  quickly, 
consequently  the  areas  where  h  »  0  and  h  =  1  are  accepted  in  Fig.  4-1  will  get 
larger  with  increasing  c.  Hence  it  is  expected  that  as  c  increases,  the  lower 
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threshold  Ajta^,  a 2)  (b2(0j,  82)  )  will  increase  whereas  the  upper  threshold 
Aj(a  ,  a2^  ( ® j C B j »  02 ) )  will  decrease.  This  behavior  is  indeed  shown  in 
Fig.  5-10. 

Figures  5-11  and  5-12  present  the  variation  of  the  probabilities  of  error 

of  type  1  and  type  2  as  well  as  the  variation  of  the  thresholds  as  a  function 

of  the  noise  variance  o  for  fixed  p,  k,  c.  We  set  p  -  0.5;  then  some  of  the 

local  minima  result  in  a,  =  a„  *■  8,  *  8.  when  a*n  =*  a*n  **  8*n  =  0*n.  Such 

12  12  12  12 

minima  are  shown  in  Figs.  5-11  and  5-12.  It  is  intuitively  expected  that  as 
the  noise  variance  o  increases  the  quality  of  information  of  the  detectors 
gets  worse,  hence  the  probabilities  of  error  c^,  a2>  0^,  @2  increase.  This 
behavior  is  actually  shown  in  Fig.  5-11.  Notice  that  for  a  >  20  the  proba¬ 
bilities  of  error  ,  a2,  0^,  02  approach  the  value  0.5,  hence  for  o  >  20  the 
observations  provide  the  detectors  with  very  little  additional  information, 
thus  the  observations  are  practically  useless. 

As  far  as  the  behavior  of  the  thresholds  is  concerned,  we  can  expect  the 
following:  As  the  quality  of  observations  gets  worse,  the  detectors  would 
tend  to  rely  more  and  more  on  their  prior  information,  thus  they  would  tend 
to  make  decisions  more  quickly.  Consequently  as  a  increases,  the  areas  where 
h  ■  0  and  h  ■  1  are  accepted  would  tend  to  increase;  hence,  the  upper  thresh¬ 
old  AjCo^,  a2)  ( B j ( 0 ^ ,  02))  will  decrease  and  the  lower  threshold  A^a^,  a2) 
(B2(0j,  02))  will  increase  with  increasing  o.  This  behavior  of  the  thresh¬ 
olds  is  shown  in  Fig.  5-12.  As  noted  before,  for  o  >  20  the  observations  are 
practically  useless,  therefore  the  thresholds  A^o^,  a 2>  (b  (0j,  02>  )  and 
A2^ai’  a2^  B2^  approach  very  close  to  each  other  because  the  detec¬ 

tors  make  decisions  based  practically  on  their  prior  information. 
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So  far  the  results  presented  in  this  section  correspond  to  local  minima 
for  which  =  8^,  a2=  B2.  There  are  many  local  minima  (Eq.  4-3)  other  than 
the  symmetric  ones.  Below  we  present  such  a  solution. 

5.3  A  NONSYMMETRIC  SOLUTION 

When  the  initial  values  of  c^,  a 2>  8^,  B2  used  for  the  minimization  of 

Eq.  4-3  are  such  that  a^n  =  B^n  =  a*n  =  B^n  then  the  symmetric  local  minima 

112  2 

of  subsections  5.1  and  5.2  result.  However,  if  the  initial  values  of 

a,,  a  ,  B, ,  8~  used  in  the  minimization  of  Eq.  4-3  are  o*n  *  c^n  *  B*n  *  B^n 
i  z  i  i  12  12 

then  the  resulting  local  minimum  are  not  symmetric.  An  example  of  such  a 
local  minimum  is  given  below. 

For  p  *  0.9,  k  =  4,  c  =  0.05  and  initial  values 

ain  =  0.2  aln  «*  0.5  Sin  “  0.74  Bln  -  0.3 

12  1  2 

the  resulting  local  minimum  is 

a,  -  0.000164  a„  *  0.999457 

1  2 

-  0.1150  B2  -  0.5694 

and  the  thresholds  A^c^,  «2) ,  A^c^,  a2) ,  B^B^  B2>,  B^Bj,  82>  are 

A,  =  1.1972429  A„  =  -0.0003791 

1  2 

=  2.791737  B2  =»  -0.4410045  . 

It  is  expected  that  the  behavior  of  the  nonsymmetric  solutions  as  a  function  of 
p,  k,  c,  o  will  be  qualitatively  the  same  as  that  of  the  symmetric  solutions. 
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Figure  5-7.  Thresholds  versus  p  (k  -  6,  c  -  0.1,  o«  0.5) 
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SECTION  6 

SUMMARY  -  CONCLUSIONS 

In  this  report  we  presented  suboptlmal  solutions  for  two  sequential 
distributed  detection  problems:  (1)  The  Infinite  Horizon  Decentralized  Wald 
Problem  and  (2)  A  Sequential  Distributed  Detection  Problem  with  Communication 
and  ordered  stopping  times. 

The  suboptlmal  algorithm  proposed  for  the  solution  of  the  infinite  hori¬ 
zon  decentralized  Uald  problem  captures  the  basic  features  of  the  optimal 
solution  and  is  easy  to  Implement.  The  numerical  results  obtained  by  the 
proposed  algorithm,  and  presented  in  Section  5,  are  intuitively  appealing. 

When  communication  is  allowed  between  the  detectors,  the  resulting  dis¬ 
tributed  sequential  detection  problems  are  not  sequentially  decomposable  in 
general.  That  is  why  simple  optimal  solutions  are  valuable.  The  algorithm 
proposed  in  subsection  A. 2  is  simple  and  very  easy  to  Implement  as  it  requires 
the  same  amount  of  computation  as  the  algorithm  proposed  for  the  infinite 
horizon  decentallzed  Uald  problem. 

The  simplicity  of  the  proposed  algorithms  is  one  of  their  major  advan¬ 
tages.  A  performance  analysis  would  be  desirable  to  determine  how  close  to 
the  optimal  these  algorithms  perform. 
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